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EXPLICIT SUBSTITUTIONS AND ALL THAT 


MAURICIO AYALA-RINCONt AND CESAR MUNOZ* 

Abstract. Explicit substitution calculi are extensions of the A-calculus where the substitution mechanism 
is internalized into the theory. This feature makes them suitable for implementation and theoretical study of 
logic-based tools such as strongly typed programming languages and proof assistant systems. In this paper 
we explore new developments on two of the most successful styles of explicit substitution calculi: the A <j- 
and As e -calculi. 

Key words, explicit substitution, higher order unification, lambda-calculus, type theory, rewriting 

Subject classification. Computer Science 

1. Introduction. This paper focuses on the uses of explicit substitutions in the language of the simply- 
typed A-calculus. Type theories were used at the beginning of the twentieth century as a formalism to deal 
with the mathematical paradoxes studied at that time and incorporated in 1940 to the A-calculus by A. 
Church [11]. The need of stronger programming languages guided type theory to the interest of computer 
scientists in the 1970’s and 1980’s, when new languages based on type theories were developed. Probably 
the most relevant of these languages is ML [42], developed by R. Milner. In the 1990’s, several proof 
assistant systems based on higher-order logics, such as Coq [5], HOL [27], and PVS [51], were developed. 
The A-calculus is the simplest logical framework for reasoning about formal properties of all these systems. 
Many of the essential techniques and computational procedures involved in these tools have been developed, 
analyzed, and improved in the context of the simply-typed A-calculus before being implemented. These 
techniques include simple mechanisms such as type checking and type inference, and more complex ones 
such as those used for dealing with the inhabitation problem and the higher order unification problem. The 
basic operation of the A-calculus is the ^-conversion that was originally defined based on an implicit notion 
of substitution where renaming of variables was informally assumed to avoid “clashes” and “captures” . This 
implicitness of the notion of substitution was not critical before this theoretical framework was used in other 
contexts than the ones of computer science, but making the notion of substitution explicit is essential when 
computational properties such as time and space complexity should be analyzed. 

We will focus on two styles of explicit substitutions: A a and As e . These calculi use a name-less notation 
for variables. Therefore, technical nuisances due to the higher order aspect of A-calculus, such as renaming 
and capture of variables, are minimized or completely eliminated in A a and As e . For these calculi, we will 
motivate and illustrate different techniques developed for important computational problems and applications 
such as higher order unification, type inference, and inhabitation problem. These kind of problems arise 
naturally in many fields of computer science. Some of the current progress in the area of explicit substitution 
is recorded in the series of “International Workshops on Explicit Substitutions: Theory and Applications to 
Programs and Proofs” - WESTAPP that runs yearly together with the Conference on Rewriting Techniques 
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and Applications - RTA. For other surveys and tutorials on explicit substitution calculi see [38, 56]. 

Firstly, in section 2 we present basic notions of the A-calculus, its representation in de Bruijn index 
notation, its simply-typed version, and the Curry-Howard isomorphism. Afterwards, in section 3, we motivate 
explicit substitutions and present the two before mentioned calculi of explicit substitutions along with their 
simply-typed versions. In section 4, we explain briefly the applications of explicit substitutions before 
concluding in section 5. 

2. The A-calculus. The A-calculus was developed by Church around 1930 [12] as a formal language for 
the foundations of mathematics and logic. Although that foundation was later revealed to be inconsistent, 
indeed Russell paradox [58] can be encoded in it, the A-calculus still provides a formal model of computability. 
Church and Kleene [37, 10] proved that the class of A-expressions and the class of partial-recursive functions 
are the same. This result, along with Turing’s own work, shows that the A-calculus is as expressive as Turing 
machines. 

The notation consists of a set A of terms and rules to manipulate them. The set A is built on a countable 
set of variables V = {x,y, . . .} and it is inductively defined as follows: V C A, if M,N e A then (M N) £ A, 
and if x £ V and Me A then A x.M £ A. Terms of the form (M N) are called applications and terms of the 
form A x.M are called abstractions. Abstractions are binding structures. As usual for these kind of structures, 
a notion of free and bound variables is necessary. The set of free variables of M, denoted TV(M). is defined 
by TV(x) = {x}, TV((M AO) = TV(M) U TV{N), and TV{\. x.M) = TV(M) \ {x}. The variable x in 
a term A x.M is said to be bound. Names of bound variables are irrelevant. For instance, Ax.x and A y.y 
represent the same A-term. This implicit equivalence is called ct-con version. Formally, if z £ J TV(M), then 
A x.M = a A z.M{z/x}, where for an arbitrary term AT, M{N/x} denotes the atomic substitution of the free 
occurrences of the variable x in M by N. 

Substitution plays a very important role in the A-calculus. In fact, the main computational rule in this 
formalism, the /3-rule, is expressed as follows: (A x.M N) — M{N/x}. Informally, it states that the 
application of a function A x.M to an argument N , results in a. term M{N/x} where the formal parameter x 
has been replaced by the argument N in M (the body of the function). An additional rule, called rj, states that 
abstractions computing the same value for the same argument are convertible. Formally, A x.(M x) — M, 
if x <£TV(M). 

The formal definition of substitution is not as simple as it seems. The following one, commonly used in im- 
plementations, is wrong: x{M/x} = M, y{M/x } = y, if y ^ x, (Mi M 2 ){M/x} = (Mi{M/x} M 2 {M/x}), 
(A x.N){M/x} = Ax. AT, and (\y.N){M/x} = \y.N{M/x}, if y ^ x. The problem arises in the last 
case: the term M may contain a free variable y which becomes a bound variable when the substitution 
is applied. A correct definition should avoid this capture; for instance, by modifying the last case with 
(A y.N){M/x} = A z.N{z/y}{M/x}, where 2 g- TV (M). 

The A-calculus is not terminating. Indeed, a term like (Ax.(x x) Ax.(x x)) /3-reduces to itself and then 
it can be always reduced. However, the A-calculus satisfies, the Church-Rosser property i.e., if Mi =p v M 2 , 

then there exists N such that Mi ► N and M 2 - — ► N. 1 In consequence: (1) the A-calculus is also 

confluent and (2) normal forms, it they exist, are unique. We refer to [3] for a complete description of the 
A-calculus and its properties. 


x As usual, if it is a term rewrite system, we denote by — A- the relation induced by Ft and by — — ► the reflexive, 

symmetric, and transitive closure of ► . Furthermore, the equational theory associated to R defines a congruence denoted 

by =r- 
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2.1. de Bruijn indices. At the beginning of the seventies, de Bruijn developed a nameless notation 
for the A-calculus [19]. In that notation, names of bound variables are replaced with indices. 

Definition 2.1. The set A^b of A-terms in de Bruijn index notation is defined inductively as 

M,N ::= n \ (M N) | AM 


where n G N >0 . 


An index counts the number of A-symbols in the binding scope of the bound variable that it represents. 
For instance, in de Bruijn index notation, the term Xx.x is written A1 since the bound variable x is in the 
binding scope of one A-symbol. Similarly, the term Xx.(Xy.(x y) x) is written A(A(2 1) 1). Note that the 
same index appearing in different binding scopes represents different variables. Vice-versa, occurrences of 
the same variable appearing in different binding scopes are denoted by different indices. 

Free variables can also be represented by de Bruijn indices. In that case, it is necessary to fix an 
enumeration, namely a referential , Xi,X 2 , ■ ■ ■ ,x n , of free variable names. If the occurrence of a variable is 
denoted by an index n and the number of A-symbols in the binding scope of that occurrence is less than n, 
say m, then that occurrence of n represents the free-variable x n - m of the referential. For instance, the term 
(A x.(y x) z) can be encoded as (A(2 1) 2) under the referential y, z and as (A(3 1) 1) under the referential 

z,y- 

The formulation of the rules (5 and q for A dS -terms requires the following functions for updating and 
substitution of indices. 


Definition 2.2. Let M G A dB- The -i-lift of M, denoted M + * is defined inductively as follows 

1. (Mi M 2 ) +i = (M+ i M 2 +i ); 

2. (XN) +i = XN +( - i+1 T 


3. n +i = 


n + 1 , if n > i 
n, if n <i 

The lift of a term M is its 0-lift and is denoted briefly as M + . 

Definition 2.3. The application of the substitution with N at the depth n — 1 on a term M, denoted 
M{N/n}, is defined inductively as follows 


1. (Mi M 2 ){N/n} = (Mi{N/n} M 2 {N/n}); 

2. (XM){N/n} = XM{N+/n + 1}; 

{ m — 1 , if m > n 
N , if m = n 
m, if m <n 

Definition 2.4. The rules ft and q are defined for the set of Ajb - terms as follows 


(AM N) M{A7l} 

A (M 1) -TU AT, if N+ = M 


Example 2.5. The A-term (A.T.(At/.(.T z) x) (z X z.(x z))) can be translated under the referential 
x,y,z into the A^-term (A(A(2 5) 1) (3 A(2 1))). Furthermore, we have 

(Xx.(Xy.(x z) x) (z X z.(x z))) ► (Xy.((z X z.(x z )) z) (z X z.(x z))). 
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We examine in detail the steps of that reduction for A^-terms: 


(A(A(2 5) i) (3 A (2 i))) (A(2 5) 1){(3 A(2 l))/i} 

= ((A(2 5)){(3 A (2 1))/1} 1{ (3 A(2 1))/1» 

= (A(2 5){(3 A (2 D)+/2} (3 A(2 1))) 

= (A(2 5){(3+ A(2 +1 l +1 ))/2) (3 A(2 1))) 

= (A(2 5){(4 A (3 l))/2} (3 A(2 I))) 

= (A(2{(4 A (3 l))/2} 5{(4 A(3 l))/2}) (3 A(2 1))) 

= (A((4 A (3 1)) 4) (3 A (2 1))) 

The A^B-term (A((4 A(3 1)) 4) (3 A(2 1))) represents the term (At/. ((z A z.(x z )) 2 ) (2 A 2 .(a; 2 ))) under 
the given referential. • 

Example 2.6. Notice that 

A((AA(5 (1 2)) 4) I) (AA(4 (1 2)) 3) 


since 


(AA(4 (1 2)) 3)+ = ((AA(4 (1 2)))+ 3+) 

= (A(A(4 (1 2))) +1 3+) 

= (AA(4 (1 2))+ 2 3+) 

= (AA(4 +2 (1 2)+ 2 ) 3 + ) 

= (AA(4 +2 (1 +2 2 +2 )) 3 + ) 
= (AA(5 (I 2)) 4) 


2.2. Simply-typed A-calculus. The A-calculus is a simple, but yet powerful formalism. As we said 
before, when used as a logical framework, the A-calculus allows the encoding of paradoxes. To solve that 
problem, Church developed a typed version of the A-calculus [11] which happens to be a simplification of the 
Type Theory of Whitehead- Russell [58]. 

The effect of typed A-calculus can be seen on a term such as \x.{x x) which is a well formed term in the 
untyped A-calculus that represents the abstract concept of “self-application”. The meaningfulness of this 
concept may be questioned and was involved in many of the logical paradoxes from the beginning of the 
twentieth century. Thinking about x as a functional variable from A to A or of “ type A— >-.4”, the application 
(x x ) is forbidden, since it’s impossible to apply a function of type .4 — k 4 to an argument of type A— tA. 
This coincides with the conception of functional objects assumed by most mathematicians. Of course, if 2 
is a variable of type A , the typed expression Aa;.(.T (.t 2 )) makes sense. For a formal introduction to the 
theory of the simply-typed A-calculus and interesting historical remarks see [30]. 

In a typed A-calculus, A-terms are stratified in several categories, namely types. A type, in the simple 
type theory , can be a basic type a, b, . . . or a functional type A-^B, where A and B are types. We use 
upper-case letters A, B ... to range over types. Only terms that follow a type discipline are considered to be 
valid. The type discipline is enforced by a set of typing rules. Thanks to the typing rules, Russell’s paradox 
cannot be expressed in the simple type theory. 
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— , X -t (Start) 

x:A , , h x : A v ’ 


x:A , , b M : B 
b Xx:A.M : A-fB 


(Abs) 


x £ , 

x:A,, 


, b M : B 
I -M :B 


(Weak) 


, I - M : A-^B , h N : A 

, I - {M N):B 


(Appl) 


Fig. 2.1. The simply-typed, A -calculus 


1 <i <n 

A\ .A % .... A n b i : A} 


(Var) 


A., b M :B 
b Xa-M : A-tB 


(Abs) 


b M : A-^B 


\- N : A 


b (M JV) : B 


(Appl) 


Fig. 2.2. The simply-typed A -calculus for A^g-terms 


Typed A-terms are elements of the set of A-terms except that bound variables in abstractions have type 
annotations, i.e., they have the form A x:A.M. Rules (3 and r) are modified accordingly: 

(Xx : A.M N) — M{N/x} and Ax : A.(M x) — M, if x £ TV{M) 

A typing judgment , b M : A denotes that the term M has type A in „ where , is a context , i.e., a list 
Xi:Ai,..., x n :A n of variable declarations. Henceforth, we use Greek letters ,, A, ... to range over contexts. 
Figure 2.1 shows the typing rules of the simply-typed A-calculus. We say that a A-term M is well typed in 
, if and only if there exists a t ype A such that , b M : A , and we say that a type A is inhabited in , if and 
only if there exists a A-term M such that , b M : A. 

The presentation of the typed A-calculus used in this paper corresponds to the Church-style. In this 
presentation, typed A-terms are elements of the set of A-terms except for abstractions, which have type 
annotations. An alternative presentation, called Curry-.style , considers typed A-terms as standard A-terms 
without type annotations. In that case, type variables should be added to the formalism. Indeed, in a typed 
A-calculus a la Curry, the type of Ax.x is a-ta where a denotes any type (See [4]). 

Type checking is decidable for the simply typed A-calculus. That is, there is a method to decide whether 
or not a term has a type in a given context according to the typing rules. As the untyped version of the 
A-calculus, the simply-typed A-calculus enjoys the Church-Rosser property and therefore it is also confluent. 
Furthermore, it also satisfies the following properties. 

Qji 

• Subject reduction , if , b M : A and M ► N , then , b N : A; 

• Type uniqueness, if , b M : A and , b M : B, then A = B; 

• Strong normalization, if M is a well typed term, then M has no reductions of infinite length. 
Therefore, due to the confluence property, normal-forms of well typed terms always exists and they 
are unique. 

In the de Bruijn setting of the simply typed A-calculus, a context , is a list of t ypes Ai A n where A i 

is the type of the free- variable represented by the index i. The empty context is denoted by e. Simply-typed 
A^s-terms are defined by the typing rules of Fig. 2.2. 

2.3. Curry-Howard isomorphism. There is a strong relation between type theory and intuitionistic 
logic. If we identify types with propositions, where an arrow type is an implication, typing rules of the simply- 
typed A-calculus correspond one to one to deduction rules of a minimal intuitionistic logic. In other words, 
typing rules are logical rules decorated with typed A-terms. This principle is known as the Curry-Howard 
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isomorphism. 

Consider an intuitionistic minimal logic where propositional formulas are built from atomic propositions 
a,b,... and the implication, i.e., if A and B are formulas then A— is a formula. We use uppercase Greek 
letters ft to range over set of formulas. We write ft, A as a shorthand for ft U {A}. A judgment ft l-j A 
denotes that A is a logical consequence of ft. A judgment is said provable (in the minimal intuitionistic 
logic) if and only if it is derived by top-down application of the following rules: 


fl,dbA (Axiom) 


fl,A\~iB 
ft F7 A->B 


(Intro) 


ft h i A—±B 

fTFTB 


flfjA 


(Elim) 


A formula A is a tautology if and only if the judgment b / A is provable. For example, the formula 
A— t((A— >B)— >B) is a tautology since it can be derived as follows: 


A, A-tB hj A-^B 


(Axiom) 


A, A^fB I -/ A 


A, A^B \~i B 
A bj (A-iB)-iB 
b, A->((A->B)->A) 


(Intro) 

(Intro) 


(Axiom) 

(Elim) 


Formally, the Curry-Howard isomorphism says that ft b/ A is provable in the minimal intuitionistic 
logic if and only if , b M : A is a valid typing judgment in the simply-typed A-calculus, where , is a list 
of variable declaration of propositions, seen as types, in ft. The term M is a A-term that represents the 
proof derivation. For instance, the term decoration of the tree derivation above results in the valid typing 
judgment b \x:A.\y:A-*B.(y x) : A-¥((A-¥B)-¥A). 

The Curry-Howard isomorphism is extended to intuitionistic first order and higher order logics and 
it is widely studied in proof theory. It is at the base of mathematic formalizations where proofs are just 
mathematical objects. Such languages are the base of automatic systems for proof construction, program 
verification and program synthesis. 

3. Explicit Substitutions. Implicitness of substitution is the Achilles heel of the A-calculus. Namely, 
the A-calculus is a convenient and compact model of the computable functions but it does not provide any 
mechanism for observing essential operational properties of these functions as time and space complexity. 
The reason for this is that the substitution involved in /^-reductions does not belong in the calculus, but rather 
in an informal meta-level. In practice, /3-reduction is not a primitive operation and is implemented based 
on a substitution generally elaborated by renaming variables and/or maintaining some variable convention. 
That makes it impossible to determine or bound in time and space the /3-reduction. 

The Aa-calculus was the first one presented formally as a mechanism for making explicit substitution 
in the A-calculus [1]. But before this, today widely considered seminal work, many empiric and theoretic 
efforts were realized in order to solve the problem of implicitness of the substitution operation. From the 
theoretical point of view, the Combinatory Logic of Curry and Feys [18] proposed the first solution to this 
problem. However, this setting does not remain close to the A-calculus and the number of primitive steps 
can be extensively larger than required by explicit substitution calculi. From the empirical point of view, 
perhaps the person who provided the foundations to take care of this problem was de Bruijn himself, when 
developing his system AUTOMATH from the middle of the 1960’s. Part of his primary conceptions was the 
previously mentioned nice nameless notation for the A-calculus [19]. His legacy is collected in [50]. 
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Since the Aa-calculus was introduced in [1], several other variants of explicit substitution calculi have 
been proposed (see, for example, [54, 38, 32, 7, 39, 17, 35, 43, 24, 44]). These calculi implement several styles 
of explicit substitutions. 

We will focus our attention on two of these styles: the A a- and the As e -styles. Both of them use a nameless 
notation based on the de Bruijn index notation, which is completely insensitive to a-conversion. That allows 
a clean and elegant meta-theoretical study of the calculi which make them suitable for implementation of 
declarative programming languages, higher order proof assistants, and automated deductive systems. Both 
styles were shown incomparable in [34]. 

The Aa-calculus and its variants have been proposed as a general framework for higher order unification 
and term synthesis [21, 22, 9, 36, 45, 47, 46, 6]. Furthermore, calculi of the Aa-family have been incorporated 
with success into programming languages and proof assistants. For example, an algorithm for pattern 
unification for dependent types, based on A a, has been implemented in the Twelf system [52]. It has also 
been relevant in the improvement of the explicit substitution for the rewrite calculus (p-calculus [14]) of the 
ELAN system, which provides a language based on rewrite rules for specifying and prototyping deductive 
systems [13]. 

The A.s e -calculus [32, 33] was developed more recently than the Aa-calculus and its main claimed ad- 
vantage over the Aa-calculus is that it remains as close as possible to the A-calculus having only one sort of 
objects. There is a close relation, until now only subjectively purposed, between the As e -calculus and the 
rewrite rules developed by Nadathur and Wilson in the early 1990’s and used in the implementation of the 
higher order logic programming language AProlog [41]. For instance the laziness in the substitution needed 
in implementations of /3-reduction, that arises naturally in the As e -calculus, is provided as the informal but 
empirical concept of suspension of substitutions by Nadathur and Wilson rewrite rules, with their notion of 
substitution being more general than the As e one. More recently their rewrite rules were published in the 
context of explicit substitution as the suspension calculus [49, 48]. Establishing formally the relations and 
differences between the As e -calculus and the suspension calculus remains as important work to be done. 

3.1. The Acr-calculus. The Aa-calculus is a first order rewrite system with two sorts of expres- 
sions: terms and substitutions. In fact, substitutions inherent to the /3-rule in de Bruijn index notation, 

(AM N ) — M{N/1}, are delayed and recorded in the Aa-calculus as (AM N) ► M[N ■ id\. Here, 

M[N ■ id] is a Aa-expression representing M with a recorded substitution N ■ id. Additional rules are nec- 
essary for applying the recorded substitution to the term M, i.e., replacing all the free occurrences of the 
de Bruijn index 1 at M with N and decrementing by one remaining free de Bruijn indices over M. De- 
laying application of substitution is widely used in implementations of functional and logical programming 
languages, because immediate substitution may give rise to a size explosion of the expressions. 

Definition 3.1 (Acr-calculus). The Xo-calculus is defined by the rewrite system depicted in Fig. 3.1 
where 


Terms M,N ::= 1 | AM | (M N) \ M[S] 

Substitutions S,T ::= id | t \ M ■ S \ S oT 

The rewrite system obtained by dropping rules (Beta) and (Eta) of A a is called a. 

In A a, de Bruijn indices are encoded by means of the constant 1 and the substitution f- We write f” as 
n-times 
/ * , 

a shorthand for f 0 • • • 0 t- We overload the notation i to represent the Aa-term corresponding to the index 


7 



(AM N) 

(M TV) [S'] 
(AM)[S] 
M[S][T } 

1 [M • S] 

M [*</] 

(Si o S 2 ) o T 
(M-S)oT 
id o S 
S o id 
fo (M • S) 
It 

i[S]-(toS) 
A(M 1) 


*\ i-e-, 


This one-shift encoding is interesting because involving a built-in deduction mechanism for arithmetic in 
implementations of systems based on the Aa-calculus makes it difficult the analysis of time and space quan- 
titative performance. But in any conceivable implementation one should use full indices at the meta-level 
instead of the one-shift encoding. 

An explicit substitution denotes a mapping from indices to terms. Thus, id maps each index i to the 
term i, f maps each index i to the term 1 + 1, S o T is the composition of the mapping denoted by T with 
the mapping denoted by 5 (notice that the composition of substitution follows a reverse order with respect 
to the usual notation of function composition), and finally, M ■ S maps the index 1 to the term M, and 
recursively, the index i + 1 to the term mapped by the substitution S on the index i. 

The Aa-calculus is not a confluent rewrite system [17], however it is confluent on ground expressions [1] 
and confluent on substitution-closed expressions (i.e., expressions without substitution variables) [54]. On 
the other hand, the a-calculus, i.e., A a without (Beta), is confluent and terminating [1]. 

A term is called pure if it does not contain substitutions. Notice that the set of pure terms in Aa and 
the set of A^s-terms are identifiable. Furthermore, the Aa-calculus simulates the A-calculus [17], i.e., the 
relations induced by — ► and ( - Beta ], a (one s tep D f (Beta) followed by a a-normalization) coincide 
on pure terms. However, the Aa-calculus does not preserve strong-normalization of the A-calculus [40], i.e., 
strongly normalizing A-terms can be reduced forever in Aa. 

3.2. The A^-calculus. As pointed out before, the one-shift encoding of indices in Aa is a theoretically 
convenient feature, but impractical for implementations. Nadathur also remarked in [48] that the non-left- 
linear rule of A a, namely (SCons), is difficult to handle in real implementations. Instead of rule (SCons), 
he suggested the meta-rule l[t ra ] • t" +1 *- t"- Since t” is a shorthand in A a, an infinite set of rules is 


M[N • id] 

(M[S] N[S\) 

AM[1 ■ (S o f)] 

M[S o T] 

M 

M 

Si o (S 2 o T) 

M[T] ■ (S oT) 

S 

S 

s 

id 

S 

N if M = a iV[t] 


(Beta) 

(App) 

(Abs) 

(Clos) 

(VarCons) 

(Id) 

(Assoc) 

(Map) 

(IdL) 

(IdR) 

(ShiftCons) 

(VarShift) 

(SCons) 

(Eta) 


Fig. 3.1. The \a -calculus [1] 


l 


1 if i = 1 
l[t n ] if i = n + 1. 
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(AM N) 
(AM) [5] 
(M N)[S] 
M[S][T } 

I [M • S'] 
M[f°] 
(M-S)oT 
f°o S 


r +i o (m • s) 

-j-n+1 q -j-m 

l-t 1 


^r-j-n+lj . -j.n+2 


A(M 1) 




M[N ■ |°] 

(Beta) 

AM[1 • (S o f 1 )] 

(Abs) 

(M[S] N[S]) 

(App) 

M[S o T] 

(Clos) 

M 

(VarCons) 

M 

(Id) 

M[T] • (SoT) 

(Map) 

S 

(IdS) 

t n o S 

(ShiftCons) 

•j-n 0 

(ShiftShift) 

-j-0 

(ShiftO) 

^•n+1 

(ShiftS) 

N if M = c jVff 1 ] 

(Eta) 


Fig. 3.2. 77ie rewrite system, A £ 


represented by this scheme. 

Non-left-linear rules are not only annoying to implement, but they are usually responsible for non- 
confluence and typing problems. Indeed, Act is not confluent [17] and it does not preserve typing in a 
dependent-type system [45], both problems because of the non-left-linearity of the calculus. 

The A^-calculus [44] is a left-linear variant of Act where f n is a first-class substitution. This allows the 
formulation of the rule suggested by Nadathur as a regular first order rule. In fact, instead of (SCons), the 
the A^-calculus has the following rule: l[t n+1 ] • T n+2 *- t n+1 . 

Definition 3.2 (A^-calculus). The Xc-calculus is defined by the rewrite system depicted in Fig. 3.2 
where 


Natural numbers 

n 

= 0 | n + 1 



Terms 

M,N 

= 1 | AM | 

(M 

N) | M[S] 

Substitutions 

S,T 

= r 1 M • 

s\ 

SoT 


The C-rewrite system is obtained by dropping rule (Beta) from A c- 

We adopt the notation i as a shorthand for l[t"] when i — n + 1. Substitutions id and f are written in 
A c as t° and t 1 , respectively. In general, t” denotes the mapping of each index i to the term i + n . Using f n , 
the scheme of rule proposed by Nadathur can be encoded in a. first order rewrite system. Natural numbers 
are constructed with 0 and n + 1. Arithmetic calculations on indices are embedded in the rewrite system. 

The A^-calculus is confluent on substitution-closed expressions and it simulates the A-calculus [45]. Just 
as Act, it does not preserve strong normalization. 

Another left-linear variant of Act is the Ao^-calculus [17]. The Ac^-calculus is a confluent first order 
rewrite system, i.e., it is confluent on presence of both term and substitution variables. However, Act^ 
raises some technical problem with reconversions due to the fact that substitutions id and 1 • f are not 
Ao^-convertible. 

3.3. The As e -calculus. The As e -calculus avoids introducing two different sets of entities as the Act- 
calculus does, insisting in this way on remaining close to the syntax of the A-calculus. Next to abstraction 
and application, the A.s e -calculus introduces substitution (ct) and updating (ip) operators. 
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(AM N) 

(AM) o'N 
(Mi M 2 )a i N 


na l N 

<4 (AM) 

V\ (Mi M 2 ) 

<4 a 

(M\ <j 1 M 2 ) <j 3 N 
(<p\ M) a j N 

(<4 M ) ^ ^ 

ip[(M a j N ) 

<4 (4*4 
<4(4*4 

A (M 1) 


Definition 3.3 (As e -calculus). The \s e -calculus is given by the rewrite system in Fig. 3.3 and the 
grammar 

M,N ::= n \ (M N ) | AM | Mo'^N \ p\M for n, j, i > 1 and k > 0. 

The calculus of substitutions associated with the X-s e -calculus, namely s e , is the rewriting system generated 
by the set of rules s e = A -s e — {a -generation, Eta}. 

Intuitively, the substitution operator, a, initiates (rule (a-generation)) one-step of /3-reduction, from 
(AM N), propagating the associated substitution innermost (rules (cr-A) and (a-app-transition)). Once this 
propagation is finished, when necessary, the updating operator, ip, is introduced to make the appropriate lift 
over N (rule (a-destruction)). Otherwise either free de Bruijn indices are decremented by one or bounded 
maintained. 

The As e -calculus simulates /3-reduction and is confluent [33]. It does not preserve strong normalization 

[28]. 

3.4. Simply-typed calculi of explicit substitutions. In this section, we only include the essen- 
tial notation of the simply-typed A £- and A.s e -calculi. Properties can be found in detail in [44] and [32], 
respectively. Typing rules in both calculi follow the scheme as those of the simply-typed Aa-calculus [21]. 

The rewrite rules of the typed A cr and As e -calculi are defined by adding to their respective set of rules 
the necessary typing information. Thus, for the simply-typed A^-calculus we have the typed rules: 

(A a-M N) — ► M[N- 1°] (Beta) 

(X a .M)[S] — ► X a .M[1 - (S o f 1 )] (Abs) 

X a .(M I) — ► N if M =c TVtt 1 ] (Eta) 
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X(Mo i+1 N) 

((Mi a { N) (. M 2 a i N )) 

n — 1 if n > i 
ip l 0 N if n = i 
n if n < i 

A (<4+i *4 

((<4 M i) (<4*4)) 

{ n + i — 1 if n > k 
n if n < k 

(Mi a J+1 N) <r i (M 2 a j - i+1 N) if i < j 
ip}- 1 M if k < j < k + i 
ip\ (M N) if k + i < j 

(<4+i M) a* (4+i-i N ) if j <k + 1 

4 (4+i-j *4 if l+j<k 
pj +l ~ l M if l < k < l + j 
N if M =, e <f$N 


(u-generation) 

(a-A-transition) 

(a-app-transition) 

(a-destruction) 

(y>-A-transition) 

(y>-app-transition) 

((^-destruction) 

(a-a-transition) 
(a-ip- transition 1) 
(a-<^-transition 2) 
(</>-a-transition) 
(y>-y?-transition 1) 
(<^-<^-transition 2) 
(Eta) 


Fig. 3.3. Rewriting system of the \s e -calculus 



(Var) 


A., I-I: A 

, h M : A-A-B , h N : A 
, h (M N) : B 


(App) 


(Id) 


, H f » , 

, h M:A , h S' > A 
, h M • S i> A. A 


(Cons) 


A., h N:B 
, \~X a .N:A^B 

, hS> A Ah M : A 
, h M[S ] : A 

, h t” > A 


(Lambda) 
(Clos) 


(Shift) 


a., h r +1 > a 

, h T t> A 2 A 2 h S > Ai 
, hSoTt>Ai 1 


(Comp) 


Fig. 3.4. Typing rules for the A ^-calculus 


A., hl:hL (Var) 
A., h N : B 


, hn:B 


(Yarn) 


(Lambda) 


, h \a-N : A—±B 

, >jh N : B , <j . i?., >j h M : d /c . 


A., h n + 1 : .B 

, h A : A-lE , h M : A 
, h (N M) : B 


(App) 


, h M a l N : A 


(Sigma) 


<fc-> >fc+i M : A /pi 
i h ip\M : A 1 j 


Fig. 3.5. Typing rules for the Xs e -calculus 


and for the typed As e -calculus: 


(A a-M N) 
(A a-M) g'N 

<4 (A A-M) 

Xa-(M 1) 



M a 1 N 
\ A .(M a i+1 N) 

XaM + i m ) 

AI if M = Se 4AI 


(a-generation) 

(u-A-transition) 

(y>-A-transition) 

(Eta) 


Typing rules for the A^-calculus and the A.s e -calculus are presented in the Figures 3.4 and 3.5, respec- 
tively. Notice that in the case of the A/; -calculus, substitutions receive contexts as types. This is denoted as 
, h S t> A. Let , be a con text of the form Ai.A 2 ...A„.A. We use the notation , <*, and , >*, for denoting 
the contexts A] ... A^ and A/,...A n . A, respectively. This notation is extended for “<” and “>” in the obvious 
manner. 

Example 3.4. In order to illustrate the use of the typing rules, we show how to infer the type of the 
term \ A ->b-Xb->c-X a .(2 (3 1)) in A s e . 

For short, let , = A.B^-C.A-a-B. Firstly, observe that 


(1) , hi: A (Var) 
Then, we have 


B^C.A^B\-\ : B^C 
(2) , h 2 : B-tC 


(Var) 

(Yarn) 


A-^rB h 1 : A-^rB 


(Var) 

B->C.A->B h 2 : A-A-B j}J arn ) 
(Yarn) 


(3) , h 3 : A—fB 


(2) 


(3) (1) 

1 h (.3 1) : B 


h (2 (3 1)) : C 


(App) 

(App) 
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Finally, notice that 


I" (2 (3 1)) : C 


(Lambda) 


B-yC.A^B h A j 4.(2 (3 1)) : A-+C 
A-aB h A B ->c-Aa-( 2 (3 1)) : (R-^j-KA-H?) 


(Lambda) 


(Lambda) 


F \a^b Ab~>c -^a-{2. (3 1)) : (A-y-B)—y-((B—^C)—r(A—^C)) 

For the A^-calculus the inference is identical except for the first steps; for instance, notice that 


B^-C.A-yB 1- f° > B-y-C.A^-B 


F t 1 > B^-C.A-^B 


(Id) 

(Shift) 


A->B h t° > A-+B 


(Id) 


B^C.A-^B h t 1 > A-yB 


h t 2 > A-tB 


(Shift) 

(Comp) 


Then, 


h t 2 > A-^yB A—±B h 1 : A-^yB 
, h 2,:A^yB 


(Var) 

(Clos) 


Remember that the language of the A/;-calculus only includes the de Bruijn index 1 and the others are 
simulated using the t n - • 

The simply-typed versions of the Xc~ and As e -calculus satisfy, among others, the properties of subject 
reduction and type uniqueness. Additionally, they are Weakly Normalizing (WN) and Church-Rosser (CR). 

4. Applications. Although in an intuitionistic logic, the concepts of propositions and types are iden- 
tified, proof construction and term synthesis do not necessarily go in the same direction. For instance, to 
prove the proposition A-^(B-yA), one may assume A as an hypothesis and then, recursively, try to prove 
(B-tA). Eventually, one gets the axiom A, B F A and the proof derivation is completed. On the other 
hand, the proof synthesis procedure decorates with A-terms the proof-tree derivation from the axioms, i.e., 
x:A,y:B F x : A, down to the conclusion, i.e., F \x:A.\y:B.x : A-t(B-+A). 

In order to synthesize a A-term at the same time as a proof is being developed, it is necessary to represent 
incomplete-proofs. Assume, for example, the proposition A-¥(B-tA). The bottom-up application of the rule 
(Abs) results in a term Xx.A.X where X is a term to be constructed of type (B—tA). A term as A x:A.X 
is called an open term and the place-holder X denotes a hole to be filled with a term of the right type, in 
this case of type (B-tA). Place-holders are also called meta-variables to distinguish them from the variables 

of the A-calculus. Meta-variables are written as uppercase last letters of the Latin alphabet: X, Y, At 

some moment during the proof derivation, we get the typing judgment x:A , , F A y.B.x : (B—tA). Hence, 
to obtain a close term, i.e., a term without meta-variables, we can instantiate the meta-variable X with 
the term A y.B.x. This results in Xx:A.Xy:B.x. In contrast to substitution of variables, instantiation of 
meta-variables is a first order replacement that does not take care of renaming of bound variables or capture 
of free-variables. 

Notice, however, that open terms are not A-terms. In fact, (1) instantiation and /3-reduction do not 
commute, and (2) instantiation and typing do not commute. To illustrate the first point, take the open term 
(Xx.X y) and the instantiation of X with x. The instantiation results in (Aana: y), which /3-reduces to 
y. However, the original term /3-reduces to X, which gets instantiated as x. To see why instantiation and 
typing do not commute, consider the context , = x:A, z:(B— >-A )— >C and the open term (z Xx.B.X) of type 
C , where X is a meta-variable of type A. If we instantiate X with the variable x of „ then w e obtain the 
ill-typed term (z Xx:B.x). 
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Meta-variables can be encoded in classical A-calculus by using a technique taken from the higher order 
unification tradition [31]. This technique uses a functional handle of scope. For instance, the open term 
\x:A.Y, where Y is a meta-variable of type B , is encoded as the A-term A x:A.(y x), where y is a fresh 
variable of type A—tB. In this case, the information that the variable x can indeed occur in a subsequent 
substitution of y is taking into account by the application (y x). Thus, an instantiation of Y with M in the 
original problem is translated as a substitution of y by A x\A-^B.M in the A-calculus. Notice, however that 
the meta-variable Y has the type B while the corresponding variable y has the type A^B. 

Explicit substitutions and de Bruijn indices allow a simple and natural notation for open terms. First, 
in a de Bruijn setting, meta-variables are just variables of the free algebra of terms. Notice that bound and 
free variables of the A-calculus are represented as indices. And second, explicit substitution calculi as Act, 
A c, and A s e , are confluent on open terms (in the case of Act and A c, on substitution-closed terms). Thus, in 
these calculi, commutation of instantiation and the /3-reduction is for free. 

We will consider meta- variables over a set A. 

Definition 4.1. The set A dB(X) of A-terms in de Bruijn index notation with meta-variables over the 
set X is defined inductively as 


M,N ::=» | X \ (M N ) | AM 


where n € N >0 , X € X. 

Definition 4.2. A valuation is a mapping from X to A dB(X). The homeomorphic extension of a 
valuation, 6, from its domain X to the domain A ds(A) is called the grafting of 9. 

As usual valuations and their corresponding graftings are denoted by the same Greek letters. Application 
of a grafting 9 to a term M will be written in postfix notation M9. For explicit representation of a valuation 
and its corresponding grafting 9, we use the notation 9 = {A' e-i X9 \ X € Dom(9)}. A grafting is the 
formal concept for meta-variable instantiation. 

The set of Act-, A £-, and As e -terms with meta-variables, and their respective grafting notion, can be 
defined in a similar way. The typing rule for meta.- variables in these systems is [21]: 

, a- F X : A x (Metax) 

where Ax and , x are, respectively, a unique type and a unique context associate to each meta- variable. By 
using this rule, typing and instantiation of meta- variables commute [21]. 

4.1. Fligher order unification. Higher order unification (HOU) is essential in automated reasoning, 
where it has formed the basis for generalizations of the Resolution Principle in higher order logics, being 
a sine qua non mechanism in the implementation of higher order proof assistants and higher order logic 
programming languages as the ones previously referenced. For a very simple presentation of HOU see [57] 
and for a detailed introduction in the context of declarative programming see [53]. As for the first order 
case, substitution is the key operation for HOU and its implicitness makes difficult the analysis of important 
computational properties. Therefore, use of calculi of explicit substitution in the formal implementation of 
HOU procedures is relevant. 

HOU problems are expressed in the language of the simply-typed A-calculus in de Bruijn indices over 
a set of meta- variables X, denoted A dB(X). Meta- variables play the role of unification variables. A simple 
example of a HOU problem is to search for function solutions F of the equality F(f(a)) = ? f(F(a)). That can 
be written in A^b{X) as (X (2 1)) =f n (2 (A 1)), where both X and 2 are of functional type, say A— >A 
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and 1 of atomic type A . A solution for X is the function identity, A^T but {A^.Qi 1), A 4 .(3 (3 1)),...} 
(correspondingly, {F = f,F = f 2 , . . .}) are solutions too. 

The first person to present a HOU algorithm of practical interest was Huet [31]. Huet’s work was 
relevant because he realized that to generalize Robinson first order Resolution Principle [55] to higher order 
theories it is useful to verify the existence of unifiers without computing them explicitly. Huet’s algorithm is 
a semi-decision one that may never stop when the input unification problem has no unifiers, but when the 
problem has a solution it always presents an explicit unifier. Unification for second-order logic was proved 
undecidable in general by Goldfarb [26]. Goldfarb’s proof is based on a reduction from Hilbert’s Tenth 
Problem. This result shows that there are arbitrary higher order theories where unification is undecidable, 
but there exist particular higher order languages of practical interest that have a decidable unification 
problem. In particular, for the second-order case, unification is decidable, when the language is restricted to 
monadic functions [23] . Another problem of HOU is that the notion of most general unifier does not apply 
and that a notion more complex than the one of complete set of unifiers is necessary. Huet has showed 
that equations of the form (A x.F a) = ? (A x.G b ) (called flex-flex) of third-order may not have minimal 
complete sets of unifiers and that there may exist an infinite chain of unifiers, one more general than the 
other, without having a most general one (for references see section 4.1 in [53]). 

The general method of HOU via calculi of explicit substitutions was introduced in [ 21 ] (for the A a- 
calculus) and consists mainly in: firstly, a translation or “pre-cooking” from HOU problems in A d B(X) into 
the language of a calculus of explicit substitutions. Secondly, an application of (first order) unification in the 
selected calculus of explicit substitutions to solve the translated problems. Finally, translation back of the 
given grafting solutions into substitution solutions of the original HOU problem. In this way HOU problems 
are solved via first order unification in the language of calculi of explicit substitution. We will explain with 
examples how reduction relations from the simply-typed Acr-calculus and As e -calculus of explicit substitutions 
are used to solve HOU problems in A d B(X). For a formal presentation of the methods consult [21] and [2], 

Definition 4.3. Let 9 = {A'i e->- a\,...,X n e->- a n } be a valuation from the set of meta-variables X 
to A d s{X). The corresponding substitution, {oi/A'i, . . . ,a n /X n }, also denoted by 0 but written in a prefix 
notation, is defined inductively as follows 

1. 9(m) = m, for m £ N; 

2. 6{X) = X{X 1 H- d , . . . , X n 1-4 a„j, for X £ X; 

3. 9{ai o 2 ) = (0(ai) 0(a 2 )); 

4 ■ 9( Aai) = A# + (ai); 

where 8 + denotes the substitution corresponding to the valuation 9 + = { Xi 1 -tol,..., X n i-a a+}. 

Unifying two terms M and N in A d /j(X) consists in finding a grafting 9 such that its corresponding 
substitution satisfies 9(M) 9(N). Notice that application of a grafting has a different effect to the 

application of its corresponding substitution. For instance, although (AA){A' h-> M} = AM, a unifier of the 
problem AA' AM is not {M/A'}, since (AX){M / X} = A(X{M+/A'}) = A M + . However, by translating 
appropriately the A d B ( A)-terms M, N, the HOU problem M N can be reduced to first order unification 
either in the Act- or in the As e -calculus. Essentially, the pre-cooking translation from terms in A d s(X) into 
the language of the Au-calculus replaces each occurrence of a meta- variable X with A'[f fc ], where k is the 
number of abstractors above the occurrence of X. For the case of the As e -calculus the pre-cooking translates 
each occurrence of a meta- variable X into ip^ +1 X, where k is as before. 

Example 4.4. Consider the problem 2 =/ J) (X 2) being 2 of type A and X of type A - tA . Introducing 
a fresh meta-variable Y of type A the problem is translated into 2 (AU 2) A X =p AY. 
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In the As e -calculus the problem is normalized into 2 =\ a Y a 1 2 AX =\ s A Y, whose solutions are {1 /Y} 
and {3/Y } giving as result the solutions {Xl/X} and {A3/A'}. 

In the Aa-calculus the problem is normalized into 2 =\ c Y[2.id\ A X =\ a A Y, from which we infer the 
solutions above. • 

Example 4.5. 

Now consider the HOU problem 2 (A Z 2), where 2 and Z are of type A. 

In the As e -calculus the problem is pre-cooked into 2 =\ s (X<PqZ 2) and then transformed into 2 =\ s 
((PgZ)a 1 2 and subsequently into 2 =\ Se (fl Z by normalization. The sole possible solution given is {Z i-4 2}. 
Observe, on the one side, that (Xip^Z 2 ){Z 2} = (A<pg2 2) =\ Se (A3 2) =\ Se 3a 1 2 =* Se 2. On the other 

side, turning back the pre-cooking transformation, this corresponds to the substitution solution (2 /Z] for 
the original problem. In fact, (XZ 2){2/Z} = ((XZ){2/Z) 2{2 /Z}) = (X(Z{2+/Z}) 2) = (A3 2). The 
previous term /3-reduces into 2. 

In the Aa-calculus the problem is pre-cooked into l[t] —\ a (AZ[t] I[t]) which Aa-reduces into l[t] =\ a 
(Z[t])[![t]-*d] and subsequently into l[t] —\ a Z[f o(l[f].*d)] and into l[f] Z[id\ and finally into l[f] =\ a 
Z giving the corresponding sole solution {Z ia l[t]}- This corresponds to the above grafting solution in 
As e . On the one side, (XZ[\\ l[t]){Z ^ l[t]} = (A((l[t])[t]) l[t]) =a<t (Al[t 2 ] l[t]) =a<t l[t 2 ][l[t ]-id\ =a<t 
l[f o(l[f].*d)] =a <7 l[t]- On the other side, turning back the pre-cooking transformation, this corresponds 
to the substitution solution {2/Z} for the original problem in A ds(T) as above. 

Notice that {1 /Z} is not a substitution solution of the previous problem, since for any de Bruijn index 
n we have (A Z){n/Z] = A (Z{n + /Z}) = A (n + 1 ). • 

The following example illustrates why pre-cooking of A-terms before applying unification rules is essential. 

Example 4.6. (Continuing example 4.5) In the As e -calculus, when normalizing the HOU problem 
2 =^ J) (XZ 2) before pre-cooking we obtain 2 =\ St Za 1 2, whose solutions are the graftings {Z i-a 1} and 
{Z i-a 3}. As previously mentioned {l/Z} is not a substitution solution of the original HOU problem. 
Analogously, in the Aa-calculus, when normalizing the corresponding problem l[t] =\ a (XZ l[t]) we obtain 
l[t] =\ a XZ[l[\\.id\, whose solutions are {Z i-a 1} and {Z i-a 1[T 2 ] } given rise to the same problem. • 

4.2. Type inference. In order to infer types of A-terms (or Aa-terms or As e -terms) we deal with new 
sets of type variables Ti and context variables 7 N. Essentially, we will take as input of a type inference 
problem a term without knowing its type and context and as output we will formulate a first order unification 
problem on type and context variables. Well-typedness of the input term will then correspond to solvability 
of the generated first order unification problem. Here we illustrate the general method mentioned above 
using the language of the As e -calculus. Simple modifications according to the typing rules of the selected 
language will adapt this method to other settings. 

Let M be a As e -term. Initially, we introduce new variables for the type and for the context of each 
subterm of M. Then M can be seen as a new term M' with all its subterms decorated with one different 
type variable as subscript and one different context variable as superscript. 

Example 4.7. (Xa-(X B -(Xc-(ZJ-1 (3% > where t* and 7*, i = 1,..., 8 are new mutually 

different type and context variables, is the decorated version of the A-term A^.Ab-Ac'.^ (3 1)). • 

Afterwards, we apply the set of transformation rules in Table 4.1 for pairs of the form (R,E), where R 
is a set of decorated terms and E a set of equations on type and context variables. The application of these 
transformation rules begin from the par (f? 0 , 0), where f?o is the set of all decorated subterms of M'. 
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Table 4.1 

Transformation rules for type inference in the A s e -calculus 


(Var) 

<f?U {!?},£} 

(Varn) 

(RU{n)},E) 

(Lambda) 

(RU{(\ a .M%)%},E) 

(App) 

(RU{(M% N%)H},E) 

(Sigma) 

(RU{{M%v i N%)%},E) 

(Phi) 

(i?U{(<4M£ )£},£) 


(Meta) (RU{XJ},E) 


-¥ ( R , E u {7 = t. 7'}), where 7' is a fresh context variable; 

-t {R,E U {7 = T[...T' n _ x .T. 7'}), where 7' and are 

fresh context and type variables; 

-t {r 2 = A->-n,7i = A72}); 

-t (f?, £U {71 = 72,72 = 73, Ti = T2-J-T3}); 

-»• = t 3 ,7i = t[... 7-2.72, 73 = ^...T.Li.72}), 

where rf, r'_, are fresh type variables and in the case 
that i = 1 the sequence t{ is empty; 

-»■ (#, E U {r, = r 2 , 72 = ri...rj;_ N _ 1 .7 , ,7i = 

where 7' and are fresh context and type vari- 

ables and in the case that k < 1 respectively k = 0 and 
i = 1 the sequences respectively r[...r' k+i _ 1 are 

empty; 

-t {R, E U {7 = , x 1 t = A x }), where , x F X : A x \ 


Notice that the transformation rules in the Table 4.1 are built according to the typing rules of the As e - 
calculus. After the application of each of the transformation rules the size of the current set of decorated 
subterms R decreases by one. Consequently, the application of these rules beginning from the pair (i?o,0) 
finishes after a finite number of steps (exactly as many steps as subterms in M) giving as result an empty 
set of decorated terms and a set Ef of equation on type and context variables. Ef is a first order unification 
problem on type and context variables. 

Finally, our algorithm terminates by applying any first order unification algorithm to Ef. If the unifi- 
cation algorithm fails then our term is ill-typed. Otherwise, if the unification algorithm succeeds, the most 
general unifier resulting as output gives straightforwardly a context , and a t ype A such that , h M : A. 
Of course, the construction of , and A is done from the bindings given in the resulting unifier corresponding 
to the outermost context and type variables selected in the decoration of M. 

Correctness and completeness of this method is a direct consequence from the correctness and com- 
pleteness of the first order unification and of the typing rules of the As e -calculus used to construct the 
transformation rules in Table 4.1. 

Example 4.8. (Continuing Example 4.7) The initial input for the set of inference rules is (Rq, 0), where 


i*o = {2??,3?ME,(2£ (mi niVrtVriMc.mi mi 

(\ B .(\c.mi (Ml (Aa-(a_b-(Ac-(2?j (M 2 2 MWIMIMIWXI)- 


In the sequel, we show the steps of the application of the transformation rules. For convenience we 
apply the rules in an specific order (from smaller to bigger subterms), but the application of the rules is 
nondeterministic. Applying the rules in any order we will obtain different sets of equations that correspond 
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to the same unification problem. 


(i?o, 0 ) -t Var 

(Ri = R 0 \ { 17 }}> E x = {73 = T3.7] }} ->• y arn 

(R 2 = Ri\ { 27 }}, E 2 =E 1 U {71 = t [ .n .7^}) -t y arn 

(R 3 =R 2 \ {3?}}, E 3 = E 2 U {72 = T2.r3.T2.73}) -+App 

(i?4 = Rs\ {(37} 171)7}}, E * = E 3 U {72 = 73 , 73 = 74, T 2 = T 3 -m}) -+App 

(R 5 =Ri\{ mi (37 2 2 17| )? 4 4 )7 5 5 } = -B4 u {71 = 74 , 74 = 75 , Ti = T4 -)-t 5 } ) ->• Lambda 

(Re = R* \ {(Ac- (27) (3?} 17})7})7})7}},£ 6 = £5 u {t 6 = C^t 5 , 75 = c. 76 }) 

<i? 7 = i? 6 \ {(A B .(A C .(27) (37} 17 3 3 )7 4 4 )7 5 5 )?®)7 7 a ^7 = ^6 u {t 7 = B^t 6 , 76 = B. l7 }) ^Lambda 


(0 = i? 7 \ {(A^.(A B .(Ac.(27) (37} i7 3 3 )7 4 4 )7})7e 6 )7})7 8 8 },^8 = ^u { t 8 = a^t 7 , 77 = A 78 }> 

Now the reader is invited to apply his/her preferred first order unification algorithm for resolving the unifi- 
cation problem E 8 = {73 = t 3 . 74,71 = t{.ti.72,72 = t}.t}.t 2 . 73,72 = 73,73 = 74, t 2 = t 3 -)-T4,7i = 74,74 = 
75, n = t 4 ->-t 5 ,t 6 = C-)-t 5 , 75 = C.^ 6 ,t 7 = B^.t 6 ,7 6 = B.^ 7 ,t 8 = A->t 7 , 77- = A.7 8 } and then to resolve 
the bindings of the resulting unifier (if it exists) for giving appropriate contexts and types for the input 
A-term. • 

4.3. Inhabitation and higher order logics. Given a type A and a context of variable declarations ,, 
the inhabitation problem consists of finding a term M such that , b M : A. Using the open term approach, 
the problem can be formulated as finding a pure instantiation for the meta-variable A' satisfying , HA': A. 
Thus, the term to instantiate A' can be constructed at the same time as the proof derivation of A by applying 
the typing rules in a bottom-up manner and introducing new meta-variables for the unknown terms. 

For the simply-typed A-calculus this problem is decidable. In fact, since provability in the minimal 
propositional intuitionistic logic is decidable, the term M can be built directly from the proof-tree derivation 
of ft h/ A, where ft is the set of types in „ as explained before. However, when we move to a first order or 
a higher order intuitionistic logic and, in consequence, we extend the type system to handle quantification, 
the problem becomes much more complicated. In [47], a. semi-algorithm to solve the inhabitation problem 
via the A^-calculus has been presented. It uses the fact that A^ is confluent on substitution-closed terms 
and weakly normalizing, even for dependent type settings of the calculus. 

Although first and higher order logics are out of the scope of this paper, we give some hints of the 
inhabitation problem for these kind of logics. See [20] for a complete description of a term synthesis algorithm 
in the Cube of Type Systems and [47] for a similar algorithm via explicit substitutions and open terms. 

The Dependent Type theory, namely An [29], is a conservative extension of the simply-typed A-calculus. 
It allows a finer stratification of terms by generalizing the function space type. In fact, in An, the type of 
a function A x\A.M is n.r:.4.i3 where B (the type of M) may depend on x. Hence, the type A^B of the 
simply-typed A-calculus is just a notation in An for the product Tlx:A.B where x does not appear free in 
B. The Calculus of Constructions, namely CC, [15, 16] extends the An-calculus with polymorphism and 
constructions of types. From a logical point of view, An and CC allow representation of proofs in the first 
and higher order intuitionistic logic, respectively. Via the types-as-proofs principle, a term of type Yix:A.B 
is a proof-term of the proposition \/x:A.B. 

Terms in these calculi can be variables, applications, or abstractions, like in classical A-calculus, or 
two new kind of terms: products (n x:A.B), and sorts {Type, Kind). Term and types belong to the same 
syntactical category. Thus, Ux.A.B is a term, as well as A x:A.M. However, terms are stratified in several 
levels according to a type discipline. For instance, given an appropriate context of variable declarations, 
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x:A, , P M : B x:A , , h B : {Type, Kind} 

, h Xx'.A.M : Bx-.A.B 


(Appl) 


(Abs) 


, I - M : n x-.A.B , \- N : A 

, FpnvJT BjTV/x) 


Fig. 4.1. Rules (Abs) and (Apl) for the CC type system 


X x\A..M : II.c:.4..B, n,c:.4..f3 : Type, and Type : Kind. The term Kind cannot be typed in any context, 
but it is necessary since a circular typing as Type : Type leads to the Girard’s paradox [25]. In Fig. 4.1 we 
give rules (Abs) and (Appl) for the CC type system. 

The A^-calculus has been extended with products for the All and (7(7-type systems in [45]. These 
variants satisfy the same properties as the simply-typed version: confluent on substitution-closed terms, 
weakly-normalizing, and subject reduction. For further details we refer to [45]. 

Example 4.9. We can proof the first order predicate (Vx.(P x))— >(P c) by finding a term A' of type 
(IIa;:A.(P x))—t(P c ) in a context where the term c has the type A and P has the type A-^tType. The 
bottom-up application of rule (Abs) results in a term A' having the form At/:(II:r:A(P x)).Y where Y is a 
term of type (P c) in a context where the variable y has the type II x:A.(P x). If we instantiate Y with 
the term (y c), which is a well typed term of type (P c), we obtain the term Xy:(XIx:A.(P x)).(y c) of 
type Bx-.(Jlx\A.{P x)).(P c). Notice that in this example we have used the meta- variables A' and Y and 
the instantiation mechanism of meta- variables to build incrementally a proof. • 

Typing of meta-variables is more complicated in dependent-type systems than in the simply-type case. 
Since meta-variables can appear in terms, types, and contexts, the typing rules should take care of possible 
circular dependences. 

5. Conclusion. The A-calculus uses an external and atomic operation to compute the substitutions 
of variables by terms. Calculi of explicit substitutions improve the substitution mechanism by allowing 
substitutions to be part of the formal language by means of special constructors and reduction rules. There 
are several versions of calculi of explicit substitutions. Figure 5.1 summarizes the main characteristics of 
some of them. All these calculi implement the /3-reduction by means of a lazy mechanism of reduction of 
substitutions. 

In this paper we have explored new developments and applications on two of the most successful styles 
of explicit substitution: Act and A s e . 
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